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PROJECT CORES: AN OVERVIEW 

This symposium introduces the notion of the use of physiological 
responses, such as galvanic skin responses, haart rate, and respir- 
ation rate, as indices of involvement in simulation. As such the 
technique provides a useful tool for the developer of simulations 
to assess the impact of his creations. Uses in both the formative and 
summative domains of evaluation are possible. 

Furthermore, the project postulates a comprehensive systems model 
of simulation and its components. Through dialog and further research 
the relationship among the components will be clarified, thereby 
aiding the development and application of new simulations. 

HISTORY & BACKGROUND OF PROJECT CORES 
The project started as three individual dissertation efforts 
for^Gene Poor, Richard Coulter, and myself. As a result of what I 
assume- to be a typical guidance and counseling process, two things 
bacame clear: 

A. That the field of educational administration suffered from 
a surfeit of surveys and role studies. 

B. That there was a need for more knowledge centering on the 
use of simulation techniques in educational administration. 

Once these two ideas crystalized we each launched into our initial 
review of the literature. The original review of literature in the 
field of simulation andicated very little experimentally derived 
and substantiated evidence as to the effect of simulations. Hunches 
and intuitive interpretations were rampant however. Additional research 
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also pointed out that the mechanisms and variables of simulation systems 
were equally unsupported and clarified by empirical research. 

The next crucial step was the identification of the possibility 
of using physiological indices as indicators of involvement. From 
there each of us went our separate routes in defining our problems 
and research. We did^ however, collectively pursue some funding from 
the graduate school at Bowling Green State University. The resultant 
funding in conjunction with our procurement capabilities allowed us 
to precede in the manner to be detailed. 

A coordinated project was initiated by Bowling Green State Univer- 
sity's Division of Educational Administration. The project's ongoing 
purposes are to define important variables in the simulation system 
and to develop implementations thereof. Three initial investigations 
comprised the first phase of the project: 

I. Dr. Gene Poor's investigation of the simulation vehicle, ie., 

the media used to convey the simulation program content to 

to the simulation participant (1973) . 
II. Dr. Richard Coulter's investigation of the relationships 

between simulation responses, background, and experiential 

variables (1973) . 

III. Dr. tMichael Dyrenfurth's investigation of the relationships 
between the simulation participant's physiological reactions 
and the effect of the simulation ( 1973 ) . 
IV^ Several additional studies presently being pursued will 
*be described by Dr. Bill Reynolds. 

2 
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THE STUDIES* COMMON CORE 

Because we three were the first at BGSU to pursue this particular 
thrust, a significant portion of our effort was devoted to establish 
the hardware necessary for the rosoarc:h effort. Four major areas of 
coimonality need to be descr iy.*)od . Tho sitnulator, the program, the 
sample, ar^d the physiological measures. 

S imulator 

The dynamic administration simulator (DAS) depicted in figure 1 
is a cubicle approximately seven foot square. Built to resemble a 
typical office, the DAS was also designed as a tool to achieve en- 
vironmental control of sou["id and temperature. The walls are attrac- 
tively panelled and treated with internal insulation. The suspended 
ceiling consists of insulating tile and contains lighting and ventil- 
ation provisions. The floor is carpeted. 

The contents of the DAS include: a desk, a desk chair, and a 
guest chair. One wall, directly in front of the desk, is fitted with 
a large rear projectioti screen that is flanked by a one-way mirror and 
a set of stereo speakers. 

The equipment used for injection of the program into the DAS 
includes a 16mm film projector, two 35mm slide projectors, a telephone, 
an audio system, and light level controls. The output and monitoring 
equipment used in the first generation DAS included a telephone, a 
keyboard response panel, a one-way mirror and physiological monitoring 
apparatus . 

Program 

The term, program, refers to tho soquonce Hid sot of inputs 
describing the simulated situation. Those inputs wore visually pro- 
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jected using cither 16nm cojpr film or 35inm color slides. Audio was 
provided by both a magnetic soundtrack on the film and by the telephone 
on the subject desk. At key times during this program pauses were 
introduced to allow the participant to respond to the problem incident 
as he chooses. The responses consisted of a choice selected from given 
ones projected on the screen • The participant indicated his' choice 
by pressing the appropriate button on the response panel on his desk. 

The program used in this study was based on and developed from 
selected incidents produced by Dr, J, Reynolds, The original 
set of incidents, that is, occurances within the principal's day, were 
part of the Alta Vista Package. As conceived by Dr. Reynolds, Alta 
Vista is intended to be a "total simulation", that is, a simulation 
with a large scale, wide angle, multiple perspective of a school 
system's operation. A script dialog was then developed to convert 
the incidents into action. Subsequently , using the help of an amateur 
theatrical group, the script was then filmed to yield the visuals re- 
quired for projection. 

The program's story line involves the experiences of a secondary 
school principal as he hv^ndles a morning of events, all intimately 
involved with a teacher walkout. To compound the situation, the prin- 
cipal is expected to carry out the superintendent's policy of keeping 
the school open. Further difficulties include an injured student, 
conniving teachers, and union pressures. After each participant is 
steeped in the situation he is presented with a set of five alterna- 
tives and is suddenly faced with the necessity of selecting an ap- 
propriate response. 

5 
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Tho Population and Sample 

The population was defined to include those educational admini- 
strators in the northwest section of Ohio that cluster around Bowling 
Green State University. Educational administrator was defined to mean 
all those presently involved in, or preparing for, administration or 
administration and teaching in public primary, secondary, and post- 
secondary institutions. It was assumed that students registered in 
the administrative courses offered by Bowling Green State University 
were representative of this population. 

Because of the nature of physiological research and because of 
the general unf amiliari ty o£ educators with the techniques employed, 
it appeared difficult to resort to the more common and respected 
random sampling techniques. Accordingly, the sampling method used 
was to call for volunteers from the above mentioned classes. It 
was attempted to obtain a total sample size of 35. The actual use- 
able sample size was 32. 

Furthermore, the difficulty or perhaps even the impossibility of 
establishing a control group, without any treatinont, precluded the use 
of a control group design. It was recognized that the chosen design 
limited the extent of inferences based upon tho study's findings. 
However, because of the exploratory nature of the investigation as well 
as the need for a study more inclusive than a strictly experimental 
investigation of just one variable, tho restrictions on inference were 
considered less important than the advantages gained by the broad 
look at activity. 
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Inprocess Physiological Instrumentation 



Ove review 

The inprocess measures consisted of continuous monitoring of 
two basic indicators of physiological activity: galvanic skin po- 
tential (GSP) and heart rate (HR). Each measure was obtained through 
the use of electrodes sensing the electric signals associated with 
the physiological phenomena concerned. Non-clinical electrode posi- 
tioning was employed. Skin preparation followed the recommended 
procedure, namely slight skin abrasion, decornif icat ion , for the GSP 
reference electrode and alcohol cleansing for the others. 
Selection of Measures 

It was necessary to select physiological indicators that would 
provide clear and relatively unambiguous information as to a subject's 
covert reactions to simulation. Furthermore, it was important that 
these measures be available continuously during the treatment. 

The major physiological variables associated with the arousal 
and emotions generated by stressful situations can bo categorized as 
pertaining to: (a) Electrical properties of the skin, ( b) Character- 
istics of the cardiovascular system, and (c) Respiratory phenomena 
(Grossman , 1967 ) . 

Because of the exploratory nature of this study, it was decided 
to focus on the three major categories of variables listed by Grossman. 
A further review of the physiological and psychophysiological litera- 
ture was conducted to determine which measure to use in each category. 
This review indicated that in order to obtain a reliable measure 
more than one indicator had to be used (Stornbach, 1966). 
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The researcher therefore set out to use three measures, one 
pertaining to each of Grossman's (1967) major categories. However, 
the review indicated (Schnore, 1959), that not all physiological 
measures would be equally useful in differentiating between partici- 
pant activity levels. Perhaps the lesser usefulness, for this study 
at least; of respiratory measures in differentiating emotional level, 
is due to the fact that they are partially subject to voluntary control 
(Grossman, 1967). Because of these findings it was decided to forego 
the use of any respiratory measure. 

Fortunately, convenient measures in each of the remaining two 
major categories meet the criterion. Grossman (1967) claims that 
"visceral responses " which include HR, GSR, and GSP are not subject 
to voluntary regulation. Thus it remained only to select the actual 
measures to be used as indicators of the activity of: (a) The electri- 
cal phenomena of the skin, and (b) The cardiovascular system. With 
respect to the former, GSP was selected despite the fact that GSR is 
the most widely used index (Laderjot. al., 1966). The initial reason 

i 

for choosing GSP was that GSR <^^ould not be measured with the apparatus 

available for this investigation. This academically unsupportable 

reason was fortunately augrriontc|d by the findings of Lader et. al., (1966), 

who reported that the "poycho-(|al vie response": their generic name 

for electrical skin phenomena ;-^can be obtainod using either bc^th GSR 

or GSP methods. 

Of the several measures available to indicate cardiovascular 
activity, HR was selected for use in this study. While, as indicated 
previously, multiple physiological measure:3 were necessary to obtain 



ERIC 



a reliable indication of activation; caution had to be exercised 
since two highly correlated measures would not augment reliability, 
HR is reported as being independent of Palmar conductance. Palmar 
conductance is the reciprocal of GSR, a measure highly correlated 
with GSP, (Lader et. al . , 1966, citing Lacey and Lacoy , 1958). 
Furthermore, just as Shapiro et. al . , (1964) reported on the useful- 
ness of the GSP as an index of the degree of activation, Schnore (1959) 
clearly includes EIR with the variables lie labels as consistent differep- 
tiators between high and low arousal levels. 
Details of the Tochnique 

\Vhile some of the specific details of the instrumentation pre-- 
cedures were able to be determined by pilot runs, many other de- 
tails were abstracted from the literature. HR< for example, presented 
little problem in electrode placement. The signals associated with 
this activity are so strong that they are easily obtainable from many 
parts of the body. Electro-dermal phenomena, however, are not as 
easily measured. Much argument exists over proper techniques (Lader, 
et. al., 1966). Venables and Sayer (1963) wore of major assistance 
in determining the specific location, of palm and arm of GSP electrodes. 
They also recommend skin abrasion for the inactive site. 
Apparatus 

The inprocess measures were recorded using a Bockman model RB 
dynograph with four available channels and a time event marker. 
Signals were sensed by Deckman miniature skin electrodes using a 
commercial paste. Both the participant's physiological reactions and 
his choice of response to the problem by means of a home built indica- 
tor box were recorded by this apparatus. 
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The physical design of the apparatus involved as inconspicuous 
connections and wiring as possible. The connectiorjs and wiring 
were arranged so as to allow the participant some freedom of movement 
within his chair. Other than the electrodes and wires, all actual 
instrumentation was mounted outside the DAS and was not seen by 
the participant until after the experiment. All experinientat ion took 
place in a shielded room with considerable accoustical control. 
Environmental variables { temperature and humidity) were controlled 
by the plant's air-conditioning system. 
Reliability of the Inprocess Measures 

With the basic physiological procedures selected, two sets of 
pilot runs were conducted to determine such things as repeatability, 
rate of adaption, (the tendency for a stimulus to produce less of a 
response near the end of an experimental period than at the beginning 
of the period) and the exact technique and locations to employ. 

Control of motion artifact was crucial to the interpretation of 
both UR and GSP. After some trial and error cycles, a set of elec- 
trode positions, exhibiting a minimum of motion artifact, was dis- 
covered. The preferred positions for the two GSP electrodes were: 
(a) The left hand palm on the fleshy part at the base of the thumb, 
near the point where the axis of the thumb and first finger intersect, 
and (b) on the left upper arn\ about two to three inches above the elbow 
on the inside surface, IIR was sensed using three electrodes: One 
reference electrode on the right upper arm, about two to three inches 
above the elbows on the inside surface, and two active electrodes, 
one each left and right, high on the upper arm's outside surface, 
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about two inches below tho shoulder, Tt was not possible to complete- 
ly control motion artifact by electrode placement and recording 
methods, but the remainder was controlled by means of the analyti- 
cal procedures employed. 

One additional fact was determined during the pilot runs. The 
independence of the two channels was demonstrated. There was no 
observable distortion in channel one (GSP) when channel two (IIR) 
was introduced and vice versa v 

The second set of pilot runs investigated the reliability of the 
GSP and HR measures in terms of the experimenter's consistency in 
electrode placement/ application, instrument operation, and trace 
interpretation. Because the ^'arousal potential" of a simulation is 
less each time a participant repeats the exposure, the repeated 
measures during treatment method of determining reliability could not 
be used. Consequently, reliability was determined using a no-stimulus 
method. It was assumed that because the only difference would be in 
the level of response, a suitable indicator of reliability would 
result. Each volunteer, N = 14, was subjected to three sessions that 
paralleled the experimental procedures except that no form of stimulus 
was given. This set of pilot runs consisted of attaching GSP and HR 
electrodes to the volunteer and then seating him in the DAS. The 
volunteer remained there for ten minutes, during which time no stim- 
uli of any sort were injected. This process was repeated for each 
of the three sessions. 

Of the three sessions, only the second and third were scored. 
The first, which was considered an acl imat ization session, wrTs 
discarded to avoid the novelty effect artifically lowering the 
reliability estimates. The remaining two sessions yielded two 
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charts or recordings, for each participant. Each chart was then 
scored to yield indices for HR, galvanic skin potential ampli- 
tude (GSPA) , and galvanic skin potential frequency (GSPF), 

Each of the indices obtained were then plotted to determine 
significant trends. This was ^ done visually, primarily to inspect 
for evidence of adaptation. Numerical inspection for adaptation 
was done by using correlated tests to examine the differences 
between beginning and ending means. From tliis data, it was deter- 
mined that there was no significant adaptation* 

The data, was also used to correlate the participants' run 
two scores with their run three scores. A separate Pearson pro- 
duct moment correlation was calculated for each of the following 
indices: HR, GSPA, and GSPF. Because of the small sample size, 
Spearman's rank order correlation was also calculated. In add- 
ition, a scattergram was plotted for each correlation to determine 
linearity. This condition appeared to be satisfied. Table 1 
presents the results of the reliability correlations. These con- 
cluded that the correlations were strong enough to warrant use of 
the developed technique. 

Table 1 

Reliability Correlations, Pilot Run Two Versus Run Three 



Physiological Cor relat ions''^ 

Indices 

Pearson r t^ Spearman r t 



GSPA .88 5.86* .75 3*5^^* 

GSPF ,85 5.10* .78 3.94* 

HR .73 3. 38* .68 2.93* 



a* N=12, the responses of two participants wore discarded as 
mavericks b. Calculated using t test by Tate {1955, p. 469) 
* Significant at the .01 level i one tail test 
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AN INVESTIGATION OF THE REUTIONSHIP RETl>fEEN INVOLVEMENT 
IN SIMULATION, AS EVIDENCED BY PHYSIOLOGICAL INDICES, AND 
CHANGE IN CONCEPT MEANING^ 

Intxo^du^ct^lon, 

It was the Intent of this study to Investigate simulation and attempt 
to identify some crucial factors, an awareness of which is necessary in 
order to develop and utilize the technique effectively. The purpose 
of the study can, therefore, be restated as seeking to contribute to 
the knowledge base of simulation theory and application. 

When assessing the effects of simulation* researchers have usually 
concentrated on the quantity of learning as a criterion. Quantity, as 
an output measure, was frequently determined using standardized tests 
or researcher developed tests. In either case> because of their uni- 
dimensionality, that is, tests developed or selected to measure a particular 
thing, their sensitivity to fine changes in the quality, not necessarily 
the quantity, of knowledge or cognitive structure, is suspect. Perhaps 
another technique would be more sensitive? Because of its multi-dimen- 
sionality, it was postulated that concept meaning, as measured by the 
semantic differential technique (Osgood, Suci, & Tannenbaum, 1967), ful- 
filled the need for a more sensitive output rueabute. 

Proponents of the simulation technique often imply that involvement 
is one, If not the, significant aspect that results in participant learning 
(Beck & Monroe, 1969a: Cruickshonk ^ Broadhcnt , 1968- Twelkcr, 1969). 
This claim for involvement is related to such other claims as relevance, 
realism, activity, and reward. By virtue of its centrality to the educa- 
tive applications of simulation, it was proposed that the claim of involve- 
ment was worth investig-iring , 

^Michhel Dyrcnfurth, Chairman, Department of Industrial Education, 
Valley City State College, Valley City, Nl) 58072. 



Involvement, if it in fact exists, by definition musC occur DURING 
the simulation experience. Furthermore, if involvement occurs during the 
process of simulation then it seems more appropriate and direct to measure 
it during the process than after the fact with retrospective questions and 
self-reporting instruments. It follows then, that if this involvement is 
generally Inherent In simulation participants, that this will be demonstrated 
by their physiological reactions during the simulation. The problem investi-- 
gated stemmed from this premise. 
Til?. JP^.oAlem 

The general problem was to determine the relationships between involve- 
ment, as manifested by physiological activity during simulation, and change 
in concept meaning. 

Simulation participants with a high level of physiological activity 
during simulation will exhibit more change in concept meaning than partici-- 
pants with a low physiological activity level* In testable form, it can be 
stated that there will be a positive relationship between physiological activity 
during simulation and change in concept meaning. 

The study's significance rests on its contribution to the knowledge of 
what occurs during simulation and how this relates to the technique's effects. 
Acceptance of the research hypothesis would provide the developer of future 
simulations with a criterion for determining his creation's usefulness in 
affecting the relevant factor structures of his participants and this may 
lead to a more effective way of evaluating the outcomes of simulation. 



THE PROCEDURES 

This study employed an experimental group, N=«32/to obtain the data 
necessary to test the hypothesis. The experimental group was subjected 
to pre and post'-treatraent measures of concept menninR, The ticatcnent 
was the simulated situation . During the sijnulatlon the participants' 
physiological responses were continuously monitored and recorded. The 
ultimate test of the problem was pursued througli correlational statistics. 
Participant change in concept meaning was correlated with the level o£- 
physiological activity as measured by the various physiological Indices, 
This allowed determination o£ an overall Index of relationship which was 
predicted to be significant and in the positive direction, 

A modified single group quasi-experimental design was employed, 
namely, a one group pretest-posttest design (Campbell and Stanley, 1963). 
The design was modified by superimposing 0^ on Xj that is, providing obser- 
vation during treatment. This alteration was necessary because of the need 
to indicate concurrent treatment and monitoring. 

The simulator and its program have already been described as have 
the inprocess measures of physiological activity. 

The participants in the experimental group underwent treatment involving 
the simulation of selected aspects of a secondary school principal's activity, 
A systems paradigm of this simulation and its context is provided by figure 1. 
Three Inputs formed the basic resources of the system. To of these, the 
program'^nd the simulator, combined to form the model abstracted from reality. 
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The partlc tp«ints, as the third Input* Interacted Individually with the model. 
This Interaction became the actual simulation, 

Ou t p u t Instrume nta tion 

A review (Dyrenfurth, 1973) indicated that a major weakness of simulation 
evaluations consisted of usin^* criterion raeasiires when some iustlfied ques-- 
tlons might be raised as to which criteria should be employed (Cherryholmes, 
1966), The question of which criteria, If any, were to be used in this 
investlRation was equally difficult to resolve. In fact, under the existing 
conditions of uncertainity as to what effect the simulation will have, the 
first step was to establish tlie fact the something has changed as a re 
suit of the participant's exposure to simulation, 

A possible second step, one based on a specific set of values* could 
then define the changes that occurred as "better'' or 'worse" tlian others. 
It was the researcher's contention, however, that the present state of 
knowledge pertaining to simulation did not permit tlie second stage to 
be applied* Accordingly, only the existence of change and its amount was 
dealt with in this study. 

Change as mentioned in the preceding paragraph requires elaboration 
and clarification. Change of what? Attitude? Knowledge? Again^ the 
determining factor was the formative stage ciic investigator found hijuself 
in. U'hiie cltange of attitude towards the priacipalslup, for example, could 
have easily been justified on the basis of the program content > it was felt 
that this would repeat the error Identified by the review, the error being 
the selection of a particular criterion while using an inadequate basis of 
operation. As a result of this decision a composite measure was sought, 
le., one that would encompass change of many kinds. It was mandatory that this 
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measure^ for example, be able to detect both coRnltlve nnd affective change 
that might result from the simulation experience. Furthermore, the 
measure had to be able to independently detect the existence of change along 
any domain. Also, since the maRnltude of the change to be measured was 
not known, the instrument had to he sensitive enough to signal small changes 
should they occur. Another instrument character istlc sugp^ested by the 
findings of Robinson (1967), Algor (1967), and Hnssi et. al. (1967), 
was indirectness^ to avoid the tcndancy of fakinp. responses. 

ThuS; a summary of the essential characteristics, abstracted from the 
review, of the Instrument used to measure the outputs of simulation yielded 
three main points. The instrument must measure change In differing domains. 
It must be able to detect the range from small to large and specific — 
non-specific changes. The third characteristic was that the instrument be 
of an indirect, non-obvious type. 

Ins t rumen t selection , A review of the traditional sources as well 
as some newer textbooks (Fox, 1969; McGrath, 1970; and Knglehart, 1972) 
suggested several techniques , the most promising of which, because of its 
flexibility and the amount of supporting research, appeared to be the 
semantic differential technique. Support for this particular choice can 
be drawn from the study by Rell (1971) who used the semantic differential for 
similar purposes, namely, the measurement of change caused by simulation, 
and from Brod» Kernoff, ^« Terwilllger, (1969), who used a semantic differ 
ential Instrument in situations x^ith a high level of anxiety. 

The methodology of the technique is extensively detailed by Osgood et. 
al., (1967), in their book Th^e Measureme^^^ and therefore will 

not be dealt with here. Osgood *s technique permitted a wide range of 
concepts to be used. To maintain the broad perspective that was so essen- 
tial to this initial investigation it appeared that a minimum of two 



concept areas must be measured. These two areas contained the maiority 
of the participant's experiences that occurred during this study. 

The first area pertained to the technique of simulation, A setnantic 
differential instrument using the concept of simulation techniques for 
educational purposes had the subjects describe the menninj* of this concept 
before and after the treatment. However^ measurinR the participant's meaninR 
of simulation was not enough. It seemed reasonable to maintain that the 
participant's view of the content presented hy the simulation mi^ht have 
changed as a result of his being exposed to it while experiencing some 
stress and anxiety. Since the concept of simulation techniques for education 
purposes was pertinent only to the technique employed, a measure of content 
was also required. Thus a second semantic differential Instrument was 
developed to determine any change in the meaning of the content presented 
during the simulation. 

I ns t r umen t d evg.l9P."\cji t . A pilot version of the instrument was initially 
developed using both conceot and scale selection procedures suggested by 
Osgood, The two main criteria involved factorial composition and relevance. 
For the pilot instrument relevance was judged using the Investigator's 
subjective opinion. Initial factorial composition uvis determined from a 
review of various studies solected from Osgood's book. In addition several 
adjective pairs (scales) were developed by the Investigator specifically for 
this study. The end result was six concept semantic differential Instrument 
encompassing the two major areas of Investigation, each at three levels of 
specificity. Thirty six scales were used and were systematically distri- 
buted with respect to the concepts and oriftln of tho scale. 

The pilot instrument was administered to a grotip of 118 students 
with similar characteristics to those of the sample. The results of this 
administration were factor analyzed using the Biomed X72 program (Varlmax 



Rotation). The resulting* factor loadint>.s served as tho bi\sin for making 
the choice as to the 54 scales used in the final experimental instrument, 

Rel tab titty of the outj)ut nie^a^surcs. Split-half reliability of the 
pilot instrument was calculated by randomly dividing the scales loading on 
specific factors into two equal groups and then correlating the scores on 
these scales for each half. This procedure was repeated for each major 
set of concepts, simulation and conflict, and for each of the first three 
factorst Because the split-half reliability method typically results in a 
low estimate (Fox » 1969) the estimated reliability was adjusted using the 
Spearman-Brown prophecy formula. Table 1 presents the results of these 
calculations. Very clearly, the desired and necessary reliability was achi^ived. 

Validity of outp ut measures > The validity of the semantic differ- 
ential measures, both pilot and experimental, rests largely on the content 
validity of the concepts used. A rationale meeting this requirement, has 
been described in the prior section dealing with the selection of the con- 
cepts to be used. Thus, it was held that the instrument possessed the needl- 
ed degree of content validity. 

Data Co 1 1 e c t i on Procedures, The volunteers comprising the sample of 
this study were asked to schedule a mutually convenient time with the 
experimenter. Each experimental session, involving only one subject, 
required apprpximately one hour. When the subject arrived at the research 
site, he saw only the outside of the simulator. No equipment of any kind 
was visible. The investigator attempted to set the subject at ease. The 
subject was then asked to take the pre-troatment semantic differential 
Instrument after which the electrodes were applied according to the pro- 
cedures developed by the pilot runs. At this time the subject was reas- 
sured that no shock would be given. 
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Table I 



Pilot semantic differential instrument: Split-half 
reliability coefficients for factors obtained by collapsing 
across levels of generality 



Test 



Factor one^ 



Factor two^ 



Factor three^ 



Pearson r Adj . r'' Pearson r Adj. r^. Pearson r Ad j . r^ 



Simulation^ 
01/001 

02/001 .69 
03/001 



,81 



.30 



.A7 



.43 



.61 



Conflict*^ 
04/001 
05/001 
06/001 



.65 



79 



,70 



.83 



.62 



.76 



a. All adjusted correlations are significant beyond .005, one tailed test 

b. Adjusted using Spearraan-Rrown prophecy formula, r(adj.)=« 2r 

c. N-118 



Treatment involved the experimenter's retreat from the DAS and the 
closing of the door. The subject was then told that a two minute silence, 
necessary to establish an initial physiological baseline, would precede 
the introduction to the simulation. Once the two minute point was reached, 
the program switclied on. Because of the mechanical nature of the program's 
sequencing mechanism, equivalent treatment was assured. 

When the program ended the participant was 'asked to remain at his desk 
and relax for an additional two minutes to establish the terminal physiological 
baseline. The experimenter entered at the end of this period and administered 
the post-treatment semantic differential Instrument. Upon completion, the 
subject had the electrodes removed. 
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appears to be some judgment as to the potency of the technique. Factor B 
places the meaning of the concept along the dimension of activity. That 
la to say, 'hot'^ and ''exclteable'* things are more active than 'cold'V and 
"calm" things. In a similar manner , Factor consisting of the scales: 
flexible-frigid and changeable — constant, locates the concept's meaning 
along a dimension that appears to define an ability to adapt. It may 
also be that this is a slightly different aspect of the activity dimension* 

Principalshlp as a concept is also def Ined by three factors, in this 
case they were designated F, G, and U to eliminate confusion. Factor F, 
composed of scales : important-^unlmportant , real- -unreal, active-^passive, 
Impresslve-^unlmpressiyc, and powerless-^-^powerful * seems similar to Factor 
A. Factor F also includes some measure of activity. Thus, this factor 
encompasses all three dimensions and thereby it is not as clear and defini- 
tive as A. Factor G is a combinatlcJn of potency: thick— thin, and activity; 
changeable —constant, cold--hot. Factor II is also a combination of potency: 
potent — ^impotcnt, and evaluation: fair- -uniair. Kron^ these descriptions, 
it can be seen that Factors A, B, and C define simulation more cleanly and 
with less interaction than Factors F, G, and H define principalshlp. 

Since the purpose of this study vas to investigate the relationship 
between concept meaning change and iihyslological activity, the next step 
was to quantify the exact rhnnge In meaning. A program was constructed 
that averaged each participant's responses for the scales comprising each 
factor. In addition, group averages were calculated and are indicated in 
Table 3. Both pre and post treatment averages were calculated in this wav. 

A correlated test was used to test the mean of the entire group's 
difference between pre and post treatment scores. This was repeated for 
each factor of each concept. Because a difference was expected > a one 
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tailed intori)retatioii was employod. The difleiH3ncos for the three factors 
describing simulation wore sign i f icant at the .1 level; however, none of 
the group differencor. For the pr inci[)alsh ip factors could be so interpreted. 

llach participant's factor ^loore was alwso proce:5sod by a program that 
calculated the D statistic: 

D ''\|7^ - A^f f (B - BM* f (C - C')^ 

The D statistic i.s the dhnancn betwoeii two points: in a three dimen- 
3ional euclldiaii space. The first point was located tv/ using the pretreat- 
rnent factor meana as coordiudi^js and the second was determined by using 
the post treatment factor means In a siiriilar' n^anner. The I) statistic thus 
calculated fonaed th.e index of change co>n->el<ued with physiological activity. 

Table 3 

rxperimental Semantic Different ial Instrument i 
Group Mean Factor Scores 



Subiect 


factor 


S I mul at i on 


factor 


Piuncipalship 






Pre 


Post 




Pre Post 


ent ire 




2.4U 


2»31 


I 


2.21 2»20 


group 




3,00 




c, 


3.b0 3.72 


mean 




2. vn 


3, 24 


H 


2.1R 2.05 


S 


ignificant differ 


ence . 


.1 level, 


one tailt: 


!d cort^elated t test 


Physiological AcLivlty 






•I. • 





The physiological measures recorded v;ere IIP and GS'P activity (not level). 
The type of recording ryiethcd allowed only ft^eqviencv rnea^^ures fot^ ]!K, but 
both amplitude and frequency could be obtained f r^om tlie n!;p i^ecording. In 
addition to the recording during the simulation, two ont? minute baselines. 
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one before and <^ne ofler treatment, woro ab;o takoiK Tiio baselines permit trnl 
calculation of deviation scores as recommended by Burstein ct. al», (1965). 
These scores reflect the difference between a participant's resting score 
and his scores during each minute of simulation. Because deviation scores 
Cdlculdled in this way compensate for different reaction sensitivities, it 
was assumed th<'at such scores are comparable across individuals. 

General scoring proc edure. The simulation [^roj;raTii was structured so 
<i^J lu allow Vdr-led I'esponse times* Thii> i-esuiled in var^ving Lolal i.umulatior) 
entry-exit times. Accoi^d ingly , to equali?:e tiio situation for statistical 
treatment, the scores wei^e all prorated to the most common duivition, namely 
19 minutes, furthermore, since all motion artifact could not be controlled 
by electrode placement^ sect ions of the physioIo[^ical ti-aces wei^e discarded 
wherever motion artifact was detected. The remainin^^, portion' of the useful 
trace was further prorated in proportion to the removed portion. 

Heart rate * The simplest of the recorded indices, heart rate, was 
scored by counting the number of r-spikes in each :ninute, 

GSP . This measure was scored to provide two indices of activity, 
amplitude and frequency. The frequency index was defined as the number of 
GSP events tlia-? occur within a given time. Ilvet^y trace peak (event) contained 
between tv;o points of inflection, either above or below the trace centerline, 
was counted to yield this score. This method of scoring was called the 
individual method. 

The project consultant also suggested an alternative scoring procedure 
for the frequency index. This method consisted of analyzing the trace to 
determine the number of complete cycles occurring per minute. This scoring 
approach was called the cyclic method. Because of the possibility that 
mono- V bi^ 1 and tri-phasic responses could occur witliin the -^ame trace and 



tfio tuibiitnjuunt Jy diificulty in detorminuig which oveiM had whicfi charact. er- 
istic » it seemed that this mothod was more subjeclivo than the individual 
.method. However > because the consultant pointed out the possibiJ.ity of 
uncovering relationships duo to the non-linearity of adjustr / , his advice 
wa<^ hf^eded. Consequently » both methods of f requency analysis were used. 

The amplitude index was drifined as the sum of tb.o vertical displacements 
of each peak from the trace centorline. Note that this index is a total of 
all deviations regardless of their directions. Burstein ct. aJ . , (1965) 
support this method when they report that "the most effective SP index of 
stimulus induced emotionality is the sum of all vertical deviations from the 
baseline (Page 2U)." 

Interpretation . Several alternative interpretation ar J scoring pro- 
cedures were made possible by the methods of <iata collection. A review 
of the literature did not yield conclusive evidence, nor even a predominance 
of opinion as to the recommended pi'ocedures , so a deductive approach was 
taken. It was assumed that the baselines » because they were taken in a 
no stimuli situation, represent a restina louoi of the organism's physio- 
logical activity. Accordingly, when stimuli five injected, as is :^he case ^ 
when the simulation is in progress, the activity level should be changed. 
Thus the al>^ebraic sum of the difference between eacli minute's score and 
the baseline, yields a gross index of the level of act ivity , which because 
of the baseline reference is comparable among individuals. However, beside 
the above, a further possibility is that any deviation, either positive or 
negative, from the baseline is indicative of activity and that in this event, 
the algebraic sum would be inappropriate and that instead the absolute sum 
would represent a more correct index of activity. 

Beyond these questions, one further logical issue remains to be resolved 
This issue pertains to the use of the baselines. Because two, one before and 



oae after li^calineTit wore taken, the two can bo averaged to provide a longer 
period for a baseline; or, it could be argued that sincx^ the second baseline 
occurs immediately after the sifnulalion ends, not enough time has elapsed 
for the participant to return to a resting state, ami that consequently 
only the first baseline is a valid indicator of tne rostinp, state. 

To insure a comprehensive analysis that encompassed many of the possi- 
bilities of this exploratory investigation, iL wa:V dec idcd to use each of 
the Lnterpreidtlon mt'Lhods described in the preceding paragraphs. It was 
also felt that this decision would provide the broadest data I)ase upon which 
subsequent investigators could build. 

Sec ondary Analyses 

The main purpose of this stage was to determine the relationships 
between physiological activity and change in concept meaning, and then 
use this to test the hypotheses. A correlation of eacli subjects' D score 
with each of the various indices of physiological activity provided the 
necessary test of the hypotheses as vvell as providing additional insights 
as to the effects of simulation. The Pearson Product Moment correlation 
coefficient was used in all the calculations shovm on Table U. 

Once Table U became available and the low correlations were noted, 
sample scattergrams were plotted from the same data that yielded the corre- 
lations. These scattergrams , v;hile typiceal of low correlations, did not 
suggest the presence of any severe departure from linearity, a necessary 
condition for correlational analyses. In addition, the assumption of 
hoK\oscedasticity was reviewed, but because of the small sample it was 
diff icult to determine the actual cond it ion of thic characterist ic. Because 
of the analytical techniques employed, it was assumed that the condition 
was met to a satisfactory degree. 



Wh I Le iconic oi the cor^relat loii^ reportofl in Table 'i are significant, 
these coefficients represent only a relationship between a single physiolog 
ical varidblo and change in concept moaning. As indicated by the review, 
combinations of physiological variables however, provide a more inclusive 
representation of involvement or the degree of activation. Consequently, 
the coefficient of multiple correlat ion (R) (Tale , 1959, p. 309) was calcu- 
lated. The results are reported in Table b. 

The multiple correlation coefficient gives the T;elat ionship between 
two or more pred I (■ f .>r' variables, taken as a toani, .jnd a single crit^jrion 
variable. This study employed both two -ind three predictor models of this 
coefficient. Physiological variable:, wm-e Lakeii exclusively as preclictOT- 
variables and change in concept meaning (D) was taken exclusively as a 
criterion. To determine if siinpl i f ication was possible, two predictor 
correlations were calculated and the results i^resentcd in Table 6. 

FINDING & CONCLlJSTONi^ 

Se mantic Differentia] 

Instrumentation and technique . Analysis of pre and post treatment 
semantic d if fcrent ial factor means suggested syiJtematic mot ion of factor 
means rather than a random scattering. This tendency combined with the 
stability of the semantic differential responses supports the finding that 
the semantic differential technique is indeed a very sensitive indicator 
of concept meaning. It must be emphasized, however, that this is probably 
only true when strict developmental procedures and pilot instruments are 
used. Since high reliability correlations wore identified, this allows 
greater confidence in the D statistic calculated fr^om the factor averages. 
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PhyHiolocjicnl indices predicting change in concept moaning 
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lit feet o f the siniulat ion . It is quite cleat^ that the semantic differ- 
ential instrument dealing with the technique oi simulation was more effective 
than the instrument pertaining to the princ Ipalship. It was determined that 
the former account!-^ fot> more variance attrihutahle to the three factors than 
the latter. This finding ha^,- tlie effect of weakening the usefulness of 
criterion scores calculated from the principalship instrument. 

The pre and post treatment semantic differential Instruments dealing 
with simulation indicated an averas;e D statistic of .G9 units, figure 2 
shoves the dimensional components of this value and ^'graphically illustrates 
the shift in meaning that r^esulted from treatment. Factor A, indicative of 
the evaluative-potency dimerjsion, changed from to 2,3 or .13 units to- 

wards a more positive meaning. Factor B, an iridex ol the activity dimension, 
changed from 3.00 to 3.ifS or .US units towards a less active meaning of 
simulation. Factor C, an index of adaptabilitv, has moved .50 units to- 
v;ards a less flexible of adaptable meaning. 

Analysis of t[ie pre and post treatment semanl ic d 1 f ferent ial instruments 
pertaining to the principalship yielded an average D stratlstic of .25 units. 
This is a very small and perhaps even insigni f leant slii ft. When this sta- 
tist ic's components were analysed,, no single dimension was found to change 
more than .224 units. Therefore, it appeared reasonable to accept the finding 
that no change in concept meaning took place. The graphic representation of 
the principalship's D stathuic's dimensional components is presented in 
Figure 3, 

If, as is suggested by ttie above paragraph, there is no shift in what 
the concept of principalship means to the participants, all correlations of 
physiological variables with concept meaning change must be insignificant * 
Tables U, 5, and 6 support this finding. 
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Both D statistici^, Indicative of shift in concept meaning* share the 
characteristic of changing most in the second and third factors. The first 
factor of each concept appears not to change very much if at all. This 
finding would suggest that the evaluative aspects of a concept's meaning are 
not as easily changed as are those aspects involving activity and potency. 

By not..evIdencing any repetitive pattern, for example, not all deviation 
scores calculated from the first baseline correlating highest, Table 4 exposed 
the finding that the various methods of scoring and interpretation do not 
greatly vary the resulting physiological — concept -meaning change corre- 
lations. No evidence was found to favor the use of either the algebraic 
or absolute method of summing deviation scores. Using similar reasoning, 
there is no evidence supporting or. reiccting elthrfr the single baseline or 
the pre and post treatment baseline method of obtaining deviation scores. 
Using similar reasoning, there is no evidence supporting or reiecting 
either the single baseline or the pre and post treatment baseline method 
of obtaining deviation scores. However, If the post treatment baseline 
is taken shortly after the simulation ends. It is cautioned that this finding 
may only be true for those simulations which do not raise the anxiety level 
of the participants to an excessive degree. 

R^lA^AoTLsh ip s h e t ve e n S ema n 1 1 c 

? ti?5>,8.tjli. A*A^/l relationships described by 

the multiple correlations in Tables 5 and 6 are all positive as they of 
course must be* In addition, .ill significant single physiological varlable-^^ 
concept-meaning change based I> statistic correlations, provided in Table 4, 
are positive. Cenerally, however, all single phyHlologlcal variable based 



correlations are very weak. Correlations based on the average GSP response, 
that Is the bottom four In Table 4, are distinctly stronp,cr than either fre- 
quency or amplitude correlations alone. 

The multiple correlations reported in Table 5 arc hij;her and thus account 
for more variance than single physiological variables. Kven though this Is 
the case, the multiple correlations evldeuccJ uy the data generated by this 
study arc interpreted to be weak. 

While the individual indices of heart rate, GSPA, and GSPF do not corre- 
late well with the criterion, a derivative from these scores does. When 
GSPR, which is a rouRh estim,ite of the average GSP response, Is calculated, 
it quite clearly correlates better than any other single variable. It must 
be pointed out that tills is only the case when frequency is calculated by 
the cyclic method as was sii>;gested by the prelect consultant. 

A comparison of Tables 5 and 6 results in the finding that with one 
exception in 16 cases, multiple correlations using three predictor variables 
are higher and thereby account for more variance than multiple correlations 
with two predictors. It must also be noted that in the two predictor case, 
one of the two predictor variables was GSPR, which is, in fact, a composite 
indicator resulting from the division of (ISPA by GSPF, This results in the 
finding that more variance is accounted for when combining three variables 
individually than by working with a subgrouplne of them, 

The multiple correlations, in both throe*and two-predictor forms* were 
alvfays bound to be higher when employing data generated using the cyclic 
method of frequency scoring. 

Conclusions 

Because this study employed correlation analysis, causal statements 
as to the effect of the simulation treatment are not perttjlssible, Beyond 



this limitation, however, correlations impose the requirement of interpre- 
tation of the degree of relationship detected. Categorical claims cannot 
be supported with these statistics. The foUowinj^ conclusions are offered 
within these r<;strictions. 

The study's general hypothesis appears to be acceptable. That is, for 
this particular sample, a positive relationship between the degree of in- 
volvement as Indicated by physiological activity, and change in concept 
meaning was evidenced. Tlie degree o£ strength of this relationship is in- 
terpretated to be moderate, but systematically laesent. Accordingly the 
strength discovered was deemed to be high enough to accept the hypothesis 
and to warrant use in the other investigations contemplated for Project CORES. 

The multiple correlations clearly support the conclusion that a number 
of predictors are of greater value than any single predictor variable. The 
predictor selection procedure as well as the final correlations both support 
the conclusion that the three physiological variables used in this study 
represent a minimum treatment of covert responses to simulation. However, the 
small amount of variance (R-^) accounted for by the three predictor variables 
clearly suggests the conclusion that other significant factors of Influences 
on simulation remain to be identified. 

The findings allow the conclusion that differing methods of inter- 
pretation, scoring/ and summing of deviations scores may provide meaningful 
information. This same data also requires the concltision that the cyclic 
method of scoring frequency is more usetul than the individual, method . 
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